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SELECTIVE LABELING AND ISOLATION OF PHOSPHOPEPTIDES AND 
APPLICATIONS TO PROTEOME ANALYSIS 

STATEMENT REGARDING U.S. GOVERNMENT RIGHTS 
This invention was made with U.S. Government support under National Science 
Foundation grant No. BIR9214821 AM04 and under National Institutes of Health grant no. 
LR33CA84698. The United States government has certain rights in this invention. 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application takes priority from U.S. provisional application serial no. 
60/210,972, filed June 12, 2000, which is incorporated by reference herein in its entirety. 

FIELD OF THE INVENTION 
A general method for selective chemical labeling of phosphate groups is provided 
which facilitates highly specific purification of molecules containing one or several 
phosphate group(s). The method is applicable to the selective labeling of phosphates in 
phosphoproteins and phosphopeptides. When combined with mass spectrometric techniques, 
the method can be employed to detect and identify phosphorylated proteins in complex 
mixtures and to precisely identify the phosphorylated amino acid. The invention has 
application in the field of proteomics where it facilitates the quantitative, global analysis of 
protein phosphorylation in a cell or tissue. 

BACKGROUND OF THE INVENTION 
Proteins are essential for the control and execution of virtually every biological 
process. The rate of synthesis and the half-life of proteins and thus their expression level are 
also controlled post-transcriptionally. Furthermore, the activity of proteins is frequently 
modulated by post-translational modifications, in particular protein phosphorylation, and 
dependent on the association of the protein with other molecules including DNA and 
proteins. Neither the level of expression nor the state of activity of proteins is directly 
apparent from the gene sequence or even from the expression level of the corresponding 
mRNA transcript. A complete description of a biological system must therefor include 
measurements that indicate the identity, quantity and the state of activity of the proteins 
which constitute the system. The large-scale (ultimately global) analysis of proteins 



expressed in a cell or tissue has been termed proteome analysis (Pennington, S. R., Wilkins, 
M. R., Hochstrasser, D. F., and Dunn, M. J. (1997), "Proteome analysis: From protein 
characterization to biological function," Trends Cell Bio. 7:168-173). 



At present no protein analytical technology approaches the throughput and level of 
5 automation of genomic technology. The most common implementation of proteome analysis 
is based on the separation of complex protein samples, most commonly by two-dimensional 
gel electrophoresis (2DE), and the subsequent sequential identification of the separated 
protein species (Ducret, A. et al. (1998), "High throughput protein characterization by 
automated reverse-phase chromatography/electrospray tandem mass spectrometry," Prot. Sci. 

10 7:706-719; Garrels, J. I. et al. (1997), "Proteome studies of Saccharomyces cerevisiae: 

identification and characterization of abundant proteins. Electrophoresis," 75:1347-1360; 
Link, A. J. et al. (1997), "Identifying the major proteome components of Haemophilus 
influenzae type-strain NCTC 8143," Electrophoresis 75:1314-1334; Shevchenko, A. et al. 
(1996), "Linking genome and proteome by mass spectrometry: large-scale identification of 

1 5 yeast proteins from two dimensional gels," Proc. Natl. Acad. Sci. U.S.A. 93:14440-14445; 

Gygi, S.P. et al. (1999), "Correlation between protein and mRNA abundance in yeast," Mol. 
Cell. Biol. 19:1720-1730; Boucherie, H. et al. (1996), "Two-dimensional gel protein database 
of Saccharomyces cerevisiae," Electrophoresis 17:1683-1699). 

The 2DE approach has been revolutionized by the development of powerful mass 
20 spectrometric techniques and computer algorithms which correlate protein and peptide mass 
spectral data with sequence databases and, thus, rapidly and conclusively identify proteins 
(Eng, J., McCormack, A., and Yates, J. I. (1994), "An approach to correlate tandem mass 
spectral data of peptides with amino acid sequences in a protein database," J. Am. Soc. Mass 
Spectrom. 5:976-989; Mann, M., and Wilm, M. (1994), "Error-tolerant identification of 
25 peptides in sequence databases by peptide sequence tags," Anal. Chem. (5(5:4390-4399; Yates, 
J. R. et al. (1995), "Method to correlate tandem mass spectra of modified peptides to amino 
acid sequences in the protein database," Anal. Chem. (57:1426-1436). 

This technology has reached a level of sensitivity which now permits the 
identification of essentially any protein which is detectable by conventional protein staining 
30 methods including silver staining (Figeys, D., and Aebersold, R. (1998), "High sensitivity 
analysis of proteins and peptides by capillary electrophoresis tandem mass spectrometry: 
Recent developments in technology and applications," Electrophoresis 79:885-892.; Figeys, 
D. et al. (1998), "Electrophoresis combined with mass spectrometry techniques: Powerful 
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tools for the analysis of proteins and proteomes," Electrophoresis 19:1811-1818; Figeys, D. et 
al (1997), "A microfabricated device for rapid protein identification by microelectrospray ion 
trap mass spectrometry," Anal. Chem. 69:3153-3160; Figeys, D. et al. (1996), "Protein 
identification by solid phase micro extraction-capillary zone electrophoresis- 
5 microelectrospray-tandem mass spectrometry," Nature Biotech. 74:1579-1583; Shevchenko, 
A. et al. (1996), "Mass spectrometric sequencing of proteins silver- stained polyacrylamide 
gels," Anal. Chem. 65:850-858). However, the sequential manner in which samples are 
processed limits the sample throughput. The most sensitive methods have been difficult to 
automate and low abundance proteins, such as regulatory proteins, escape detection without 
10 prior enrichment, thus effectively limiting the dynamic range of the technique. In the 2DE- 
based approach proteins are quantified by densitometry of stained spots in the 2DE gels. 

The development of methods and instrumentation for automated, data-dependent 
--- electrospray ionization (ESI) tandem mass spectrometry (MS n ) in conjunction with 

microcapillary liquid chromatography (uLC) and database searching has significantly 
I I' increased the sensitivity and speed of the identification of gel-separated proteins. As an 
v alternative to the 2DE/MS n approach to proteome analysis, the direct analysis by tandem mass 
spectrometry of peptide mixtures generated by the digestion of complex protein mixtures has 
J been proposed (Dongr'e, A. R. et al. (1997), "Emerging tandem-mass-spectrometry 
0 = techniques for the rapid identification of proteins," Trends Biotechnol. 15:4 1 8-425). uLC- 
ZQ Ms/MS has also been used successfully for the large-scale identification of individual 
t" proteins directly from mixtures without gel electrophoretic separation (Link, J. et al. (1999), 
" "Direct analysis of large protein complexes using mass spectrometry," Nat. Biotech.l 7:676- 
682; Opiteck, GJ. et al. (1997), "Comprehensive on-line LC/LC/MS of proteins," Anal. 
Chem. 69:1518-1524.) 

25 While these approaches dramatically accelerate protein identification, the quantities of 

the analyzed proteins cannot be easily determined due to the observation that mass 
spectrometers are inherently not quantitative devices. Direct mass spectrometric analysis of 
protein mixtures by mass spectrometry can be made quantitative by the application of stable 
isotope dilution theory, whereby two chemically identical analytes (one representing an 

30 internal standard and the sample to be measured) are labeled with stable isotope tags of 

identical chemical composition but different mass. This principle has been implemented in 
quantitative proteome analysis by the development of a class of chemical reagents termed 
isotope coded affinity tags (ICAT). (Gygi, S.P. et al. (1999), "Quantitative analysis of 
complex protein mixtures using isotope-coded affinity tags," Nat. Biotechnol. 17, 994-999.) 
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ICAT reagents and their application to the analysis of complex protein mixtures have been 
shown to substantially alleviate the dynamic range problem encountered by the 2DE/Ms n 
approach. 

Protein phosphorylation is one of the most important regulatory events in cells. The 
5 state of activity of numerous enzymes and processes and the association of specific proteins 
into functional complexes are frequently controlled by reversible protein phosphorylation 
(Graves, J.D. & Krebs, E.D. (1999), "Protein phosphorylation and signal transduction," 
Pharmacol. Ther. 82, 111-121; Koch, C. A. et al. (1991), " SH2 and SH3 domains: elements 
that control interactions of cytoplasmic signaling proteins," Science 252, 668-674; Hunter, T. 

10 (1994), "1001 protein kinases redux-towards 2000," Semin. Cell Biol. 5, 367-376). The 
principle goals of studying protein phosphorylation are the identification, quantitation and 
determination of the biological function of phosphorylation site(s) in phosphoproteins. Much 

£ of the difficulty in such studies lies in the fact that many phosphoproteins exist at only at very 
low abundance. Further, proteins are often phosphorylated at a low stoichiometry and at 

15 multiple sites. Therefore, it is usually difficult to obtain sufficient amounts of pure 
phosphoprotein for such analyses. All current methods for the analysis of the 
phosphorylation state of proteins focus on one purified phosphoprotein at a time (Verma, R. 

- . et al. (1997), "Phosphorylation of Siclp by Gl Cdk required for its degradation and entry into 

- S phase," Science 278, 455-60; Watts, J.D. et al. (1994), "Identification by electrospray 
20 ionization mass spectrometry of the sites of tyrosine phosphorylation induced in activated 

- Jurkat T cells on the protein tyrosine kinase ZAP-70," J. Biol. Chem. 269, 29520-29529; 
Gingras, A.C. et al. (1999), "Regulation of 4E-BP1 phosphorylation: a novel two-step 
mechanism," Genes Dev. 13, 1422-1437). Because cellular proteins are coordinately 
phosphorylated to control specific biological processes, the complex mechanisms that 

25 control biological systems by protein phosphorylation are difficult to investigate using 
current technology. 

Because phosphopeptide(s) typically are infrequent and of low abundance in protein 
digests, highly purified or enriched phosphopeptide samples are needed for mass 
spectrometric analysis. The need to selectively enrich for phosphopeptides prior to MS 
30 analysis is particularly urgent if a protein mixture rather than a single purified phosphoprotein 
is being analyzed. In addition, no MS-based method to quantify protein phosphorylation 
directly is currently available. Quantitative study of protein phosphorylation often involves 
methods such as 32 P radiolabeling (Oda, Y. et al. (1999), "Accurate quantitation of protein 
expression and site-specific phosphorylation," Proc. Natl. Acad. Sci. USA 96:6591-6596). 
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Therefore, an MS-based method that allows both the identification of the sites of 
phosphorylation from complex mixtures of proteins and their quantitation will be an 
essential part of proteome analysis. 

Thus, there is a substantial need in the art for a more rapid and general method for the 
analysis of protein phosphorylation, particularly in complex protein mixtures, that does not 
require purification to homogeneity of individual phosphoproteins. The present invention 
provides such a method. 

SUMMARY OF THE INVENTION 
This invention provides a method for selective labeling of phosphate groups in natural 

and synthetic oligomers and polymers in the presence of chemically related groups such as 

carboxylic acid groups. The method is specifically applicable to biological oligomers and 

polymers, including phosphopeptides, phosphoproteins and phospholipids. Selective labeling 

of phosphate groups in proteins and peptides, for example, facilitates separation, isolation and 

detection of phosphoproteins and phosphopeptides in complex mixtures of proteins. 

Selective labeling can be employed to selectively introduce phosphate labels at phosphate 

groups in an oligomer or polymer, e.g., in a peptide or protein. Detection of the presence of 

the label, is used to detect the presence of the phosphate group in the oligomer or polymer. 

The method is of particular use for the detection of phosphoproteins or phosphopeptides. The 

phosphate label can be a colorimetric label, a radiolabel, an isotope label, a fluorescent or 

phosphorescent label, an affinity label or a linker group carrying a reactive group (or latent 

reactive group) that allows selective attachment of the oligomer or polymer (protein or 

peptide) to a phosphate label, to an affinity label or to a solid phase material. 

Selective attachment of a phosphate group to its affinity label or selective attachment 
to a solid support allows selective isolation of an oligomer or polymer (e.g., protein or 
peptide) that carries at least one phosphate group. The presence of an affinity label allows 
capture of the selectively labeled oligomer or polymer using a capture reagent that 
specifically bonds to the affinity label. The presence of a linker that allows selective covalent 
attachment of the oligomer or polymer to a solid surface allows physical separation of the 
selectively labeled oligomer or polymer from non-selectively labeled (non-phosphorylated) 
species in a sample. The method is particularly useful for the selectively labeling of proteins 



and peptides having a phosphate group (phosphoproteins and phosphopeptides) in the 
presence of carboxylic acid groups. The method of this invention can be used specifically to 
separate phosphoproteins and phosphopeptides from mixtures of non-phosphorylated proteins 
and/or nonphosphorylated peptides and thus to overcome detection problems due to low 
5 levels of these species in samples. 

In specific embodiments the method is applied to the separation, detection and 
identification of phosphopeptides and phosphoproteins in one or more samples. The method 
can simply be used to selectively introduce a phosphate label that allows detection of the 
presence of one or more phosphate groups in a peptide or protein. The method can also be 
10 used to selectively introduce an affinity label at a phosphorylation site in a peptide or a 
protein or selectively bond a phosphoprotein or phosphopeptide to a solid surface. 

5. When combined with well-known methods of mass spectrometry, the selective 

" labeling method of this invention facilitates separation of phosphopeptides from mixtures and 
facilitates their detection by mass spectrometry and facilitating sequencing of the peptides by 
16 tandem mass spectrometry. Art-known methods can be applied to identify phosphoproteins 
i in a sample from the sequences of phosphopeptides detected in a sample. When combined 
Z with methods for differential isotopic labeling, the method of this invention can be employed 
K to quantify relative amounts of phosphopeptides and phosphoproteins in different samples. 
^ r These quantitative methods allow a comparison of phosphorylation state in samples derived 
20 from different sources (e.g., from different cell-types or different organisms), samples that are 
differentially affected by a stimulus (e.g., administration of a drug or contact with a 
potentially toxic material), by a change in environment (e.g., nutrient level, temperature, 
passage of time) or by a change in condition or cell state (e.g., disease state, malignancy, site- 
directed mutation, gene knockouts) of a cell, tissue or organism from which the sample 
25 originated. The phosphoproteins identified in such a screen can function as markers for the 
changed state. Phosphopeptides and phosphoproteins from any naturally-occurring 
environment or artificially-controlled environment can be assessed by the methods herein. 
The method can be applied to mixtures of naturally-occurring proteins or peptides, as well as 
to mixtures of proteins or peptides derived from recombinant or synthetic methods. 
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The selective labeling method of this invention comprises the following steps as 
applied to phosphopeptides and phosphoprotein: 

(1) the carboxylic acid groups in proteins or peptides in one or more samples 
containing proteins are permenently selectively protected so that any 
phosphate groups in the proteins or peptides in the sample (s) are unprotected 
(remain as free phosphate groups); 

(2) the free phosphate groups in the proteins or peptides of the sample(s) are then 
selectively reacted with a label (e.g., a phosphate label, a radio label, isotope 
label, or an affinity label) or with a linker carrying a reactive group or a latent 
reactive group that will facilitate selective attachment of phosphoproteins or 
phosphopeptides to a label or to a solid surface; and 

(3 a) selectively labeled proteins or peptides are detected using the presence of the 
label as a measure of the presence of at least one phosphate group; or 

(3b) selective attachment of peptides or proteins to an affinity label or to a solid 
surface to facilitate selective isolation of phosphopeptide and /or 
phosphoproteins separated from proteins and peptides that do not have a 
phosphate group. 

In a preferred embodiment, selective labeling of phosphate groups is accomplished by 
initial reaction of the proteins and peptides in one or more samples with a protective group, 
such as an amine, that reacts in the presence of a condensation catalyst with and protects both 
carboxylic acid groups and phosphate groups. Amines react with carboxylic acid (or the 
corresponding esters) to form carboxyamidate bonds. Amines react with phosphates or 
phosphate ester groups to form phosphoramide bonds. The labeled phosphoramide bonds in 
the protected proteins and peptides are then selectively cleaved with a reagent that does not 
cleave carboxy amide bonds. This results in the regeneration of free phosphate groups which 
can be selectively labeled or linked to a solid surface. In a specific embodiment, an amine, 
such as ethanolamine, can be used for initial protection of all carboxylic acid and all 
phosphate groups. For example, a carbodiimide catalyzed condensation of an amine with the 



peptide or protein forms amide and phosphoramide bonds. Ethanolamine can be selectively 
cleaved from the phosphate groups of the protein or peptide by treatment with mild acid 
conditions (e.g., trifluoroacetic acid (tfa), 10-30% by volume in water, exemplifying mild 
acid conditions). Excess protecting agent (e.g., excess amine) can be removed by extensive 
washing of the peptides on a reverse phase column. In specific embodiments, the free 
phosphate is reacted with a linker group that carries a reactive functional group including a 
latent reactive group (such as a sulfhydryl group) that can be used to attach the 
phosphoprotein or phosphopeptide to a solid support or used for selective labeling of 
phosphopeptides and phosphoproteins with a phosphate label. 

For example, carbodiimide-catalyzed condensation reactions can be used to attach cystamine 
to free phosphate groups. The disulfide bond in the cystamine can be cleaved to generate a 
reactive sulfhydryl group. (Cystamine is an example of a group that carries a latent reactive 
group). 

Phosphopeptides or phosphoproteins that have been selectively separated and isolated 
from other proteins or peptides in a sample are cleaved from the affinity label or solid support 
and analyzed by conventional mass spectrometric techniques including tandem mass 
spectrometry to detect, identify by sequencing or quantitate the phosphopeptides or 
phosphoproteins in one or more samples. 

Proteins and/or peptides in different samples can be differentially isotopically labeled 
to facilitate comparison of amounts of the same peptide or protein in different samples. 
Isotopic labeling is typically introduced at the carboxylic acid protective groups (e.g., at 
amine groups such as ethanolamine). 

In this method, amine groups in the proteins or peptides are preferably also treated 
with a protective group selective for reaction with the amine side groups of amino acids (e.g., 
the s-amino group of lysine side chains, and/or peptide a-amino groups). This treatment 
controls crosslinking of amine side chains during sample preparation. If sulfhydryl linker 
groups are employed, it is preferred that the samples be treated with a reagent that reduces 
disulfide bonds. Selectively labeled samples are also optionally treated with hydroxyamine 
to remove tyrosine adducts that may have formed during sample preparation. 



This invention also provides kits for selective labeling of phosphate groups which 
contain reagents needed to carry out selective labeling and optionally contain positive and 
negative controls for use with the kit reagents. A reagent kit comprises a carboxylic 
acid/phosphate reactive protective group and a label or linker group that is to be selectively 
bonded to the phosphate groups in the phosphoproteins or phosphopeptide. The kit also 
contains any catalysts or condensation agents needed to facilitate the reaction (e.g., 
carbodiimide). In addition, the kit optionally includes a reagent for selective cleavage of 
phosphoramide bonds in the presence of amide bonds (e.g., a dilute acid that can be used to 
generate the mild acidic conditions for selective cleavage). Kits can include reagents in 
premeasured aliquots for assay of a selected number of samples. 

The label may be an affinity label and, if so, the kit preferably contains a capture 
reagent appropriate for use with the affinity label. A kit optionally contains protective groups 
for amines (e.g., t-boc or f-moc), and solid phase materials. The kit may further contain a set 
of differentially isotopically labeled protective groups, linkers, affinity labels, or other labels 
(fluorescent, chromophoric or phosphorescent) to allow quantitative determination of the 
amounts (or relative amounts ) of phosphoproteins and phosphopeptides in different samples. 
With respect to fluorescent, chromophoric, radiolabels or other labels, different types of 
labels can be used to label the phosphates in different samples. For example, different 
fluorescent labels that are separately detectable and can be individually quantitated (e.g., 
fluorescein amine, rhodamine amine) can be used to label different samples and to detect 
relative amounts of labeled peptides in difference samples. Kits further optionally contain 
instructions for carrying out selective labeling, as well as, directions for conducting various 
types of analysis that can be used in combination with the kit to detect, identify, or quantitate 
phosphopeptides and phosphoproteins. 

In specific embodiments this invention provides: 

A method for selectively labeling or tagging phosphate groups in one or more natural 
or synthetic peptides or proteins in the presence of one or more carboxylic acid groups by 
reacting the natural or synthetic peptides or proteins with a protective group that reacts to 



protect the phosphate groups therein by forming phosphoramide bonds and to protect the 
carboxylic acid groups therein by forming amide bonds; thereafter treating the protected 
peptide or protein under conditions which selectively substantially cleave the phosphoramide 
bond, without substantially cleaving the amide bond to regenerate free phosphate groups in 
5 the peptide or protein; and reacting the free phosphate groups in the peptide or protein, in 
which the carboxylic acids groups remain protected, with a label or tag comprising a 
functional group that reacts with a phosphate or with a linker that comprises two or more 
functional groups that function to bond a phosphopeptide or phosphopeptide to a solid 
support. 

10 Reagents which substantially cleave one bond in the substantial absence of cleavage 

in the other exhibit at least about a 10:1 ratio of cleavage of one bond to another (measured in 
terms of the rate of reaction or the amount of cleavage product detected) and preferably 
exhibit at least about a 20: 1 ratio, and more preferably, at least about a 1 00: 1 ratio of cleavage 
of one bond to the other. Of course for applications to the methods herein, the reagents for 

15 selective cleavage of bonds are preferably chosen to cleave one bond without any measurable 
cleavage of the other bond. 

In this method the phosphopeptide or phosphoproteins can be covalently attached to a 
solid support material through reaction with a sulfhydryl group of the linker and the solid 
support can comprise immobilized iodoacetyl groups for reaction with sulfhydryl groups. In 
20 this method phosphopeptides or phosphoproteins an be separated from a mixture by 

attachment to a solid support or by binding of the phosphopeptides to a capture reagent via an 
affinity label. 

A method for detecting one or more phosphopeptides in one or more samples 
containing a mixture of peptides by selectively protecting the carboxylic acid groups of the 
25 peptides in the one or more samples such that any phosphate groups in the peptides remain 
unprotected; selectively labeling the unprotected phosphate groups in the peptides in the 
sample with a label having a functional group that reacts directly or indirectly with a 
phosphate; detecting the peptides carrying the label to detect the phosphopeptides in the 
sample. The label can be a radiolabel, an isotope label, a fluorescent label, a colorimetric 
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label or an affinity label, the label can also be a reactive label which carries at least one 
reactive group or at least one latent reactive group. A latent reactive group is a group that 
must be activated for reaction, e.g., it can be a group that carries a protective group and which 
becomes reactive on removal of the protective group. 

In this method tandem mass spectrometry can be used to determine the amino acid 
sequence of peptides and the precise position of the phosphorylated amino acid within the 
peptide sequence. Quantitation of the relative amounts of phosphopeptide can be 
accomplished by use of differentially isotopically labeled labels or tags. Tandem mass 
spectrometry can also be used to detect one or more phosphopeptides in a sample and 
determine the relative amounts of one or more phosphopeptides in the two or more samples 
by measuring the relative amounts of differentially isotopically labeled labels or tags present 
in the one or more samples. 

The method also provides kits for selectively labeling phosphopeptides in a mixture 
of peptides by reacting the peptides in one or more samples with a protective group which 
reacts with a carboxylic acid or ester thereof and which also reacts with a phosphate group; 
and employing an acid reagent for selectively regenerating free phosphate groups in the 
peptide by reacting the protected peptides under sufficiently mild acid conditions such that 
the phosphoramide bond is substantially cleaved and the amide bond is substantially not 
cleaved. The kit can further comprise any one or more of the following: a radiolabel, a stable 
isotope label, a fluorescent label, a colorimetric label, an affinity label, a capture reagent with 
a corresponding affinity label, a reactive label, protective groups for amine groups, one or 
more solid supports, an iodoacetylated solid support, one or more enzymes for carrying out a 
protein digest; and reagents for carrying out the various enzymatic or chemical reactions of 
the detection or separation methods herein. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a scheme illustrating the chemistry of phosphopeptide/phosphoprotein 
labeling and purification of this invention. 
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Fig. 2 is a flow chart illustrating quantitative comparison of phosphopeptides and 
phosphoproteins in two different samples. 

Fig. 3 illustrates the results of mass spectrometric analysis of differentially labeled 
samples of phosphoangiotensin as described in Example 1 . 

5 Figs. 4A-D illustrate the results of application of the phosphopeptide isolation 

strategy of this invention to the phosphoprotein p-casein. A tryptic digest of p-casein was 
analyzed by LC-MS/MS both before (Figs. 4A and 4C) and after (Figs. 4B and 4D) 
phosphopeptide isolation according to the procedure of this invention. 10 pmol of starting 
material was used for phosphopeptide isolation. Fig. 4A is an ion chromatogram of 1 pmol of 
1 0 p-casein digest before phosphopeptide isolation. The peak at m/z =1 03 1 .6 represents the 

i doubly charged form of the expected phosphopeptide of the p-casein tryptic digest. Fig. 4B 
is an ion chromatogram of the isolated phosphopeptides of the p-casein tryptic digest. The 
peak at m/z = 1 1 82.5 represents the doubly charged form of the same tryptic phosphopeptide 
from p-casein, indicated in Fig. 4A, but additionally modified on its seven carboxylate groups 
15 with ethanolamine. Fig. 4C is the CID spectrum of p-casein digest in Fig. 4B. The peak at 

= m/z = 938.0 represents the doubly charged form of the selected parent ion (m/z = 1031.6) 

minus the H 3 P0 4 group. Fig. 4D is the CID spectrum of isolated phosphopeptides of the p- 

: casein digest in Fig. 4C. Again, the peak at m/z = 1 133.6 represents the double charged form 

of the selected parent ion (m/z = 1 182.5) minus H 3 P0 4 , and the y-ion series used for peptide 
20 identification is indicated. The b-ion series are much less intense and are omitted for clarity. 

Figs. 5A and 5B illustrate phosphopeptide isolation from the Lck tyrosine-myelin 
basic protein (MBP) kinase reaction mixture. Fig. 5 A is an LC-MS ion chromatogram of the 
phosphopeptides isolated from a tryptic digest of protein mixture generated from in vitro 
kinase reaction between Lck and MBP. Fig. 5B is a CID mass spectrum of the most intense 
25 ion (m/z = 630.1, 2+ ion). This peak was subjected to CID analysis and database searching, 
which identified the peptide as THY*GSLPQK from MBP, with the tyrosine residue being 
phosphorylated. 
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Figs. 6 A-C illustrate the results of phosphopeptide isolation from a yeast cell lysate. 
Fig. 6A is an LC-MS chromatogram of the phosphopeptides isolated from a tryptic digest of 
whole yeast cell lysate. Fig. 6B is an integrated mass spectrum of ions eluting from the LC 
column with retention times between 24.7 and 26.5 min, as indicated in Fig. 6A Major ion 
5 peaks that additionally exhibited a loss of 98 Da on CID, indicating that they are 

phosphopeptides, are annotated with an asterisk (*). Fig. 6C is the CID spectrum recorded 
for the peptide peak indicated in Fig. 6B at m/z ± 1032.7. This spectrum was sufficient to 
identify the phosphopeptide as TAGIQIVADDLT*VT*NPAR from enolase. However, the 
exact site of threonine phosphorylation was not unambiguously defined because of the 

10 difficulty in assigning y5 and y6 ions. Both potential locations for the phosphate have thus 
been indicated (*), although the parent ion mass confirms the peptide as a singly 

rj phosphorylated species. 
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DETAILED DESCRIPTION OF THE INVENTION 
This invention provides methods for detecting the presence of phosphopeptides and 
phosphoproteins in a mixture, identifying phosphopeptides and phosphoproteins that are 
present in a mixture and determining the relative amounts of phosphopeptides and 
5 phosphoproteins in one or more mixtures. The methods are based on the ability to selectively 
from covalent bonds to phosphate groups of peptides in the presence of carboxylic acid (or 
ester) groups and amine groups of peptides. The methods are more specifically based on the 
ability to selectively bond a label or linker to a phosphate group in the presence of carboxylic 
acids. In particular, the method relies on the ability to cleave phosphoramide bonds under 

10 mild acid conditions which do not cleave amide bonds. Thus, selective labeling or linking to 
a phosphate group in the method proceeds by initial conversion of peptide and protein 
carboxylic acid groups (of the C-terminus and amino acid side groups) to amides and the 
conversion of phosphate groups of peptides and proteins to phosphoramidate. Thereafter, the 
phosphoramidate are selectively cleaved without cleaving the amides and the free phosphate 

1=5 groups are reacted with selected labels or linkers to facilitate detection, identification and 
quantitation of phosphopeptides in one or more samples. 

r A preferred method for forming amide and phosphoramide bonds is by condensation 

with free amines. Condensation can be accomplished using various condensation catalysts 
known in the art, but the use of carbodiimide is a preferred method. In general any amine can 

20 be employed but alkanol amines, such as ethanolamine are preferred. The amine can function 
simply as a protecting group to facilitate selective reaction with phosphate groups, it can 
carry a detectable label, e.g., carry a group or moiety that can be detected, such as a 
radiolabel, a fluorescent label or the like, or it can carry a group or moiety that can react (a 
reactive functionality), a group that can be made to react (a latent reactive group) or that can 

25 form a bond or complex to another species (e.g., an affinity label which will bond or complex 
to a capture reagent). 

Phosphoramide bonds are selectively cleaved in the presence of amide bonds by use 
of mild acid conditions. Both the strength of the acid and the length of time of expose to 
acidic conditions can be controlled to obtain selective cleavage of the phosphoramide bonds. 
30 A preferred treatment uses dilute trifiuoroacetic acid, e.g., 10% or less by volume in water for 
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a selected time up to several hours. Thereafter the free-phosphate group can be reacted with a 
variety of labels or linkers. For example, the presence of a phosphopeptide in a sample can 
be detected by the presence of a selected label that has been selectively attached to a 
phosphate group. Phosphopeptides can be separated from non-phosphorylated peptides by 
5 selective attachment to a solid support, for example, via a linker group that is selectively 

attached to the phosphate group. Selective attachment of phosphopeptides to a solid support 
can be used to isolate and purify phosphopeptide in one or more samples and facilitate their 
detection and identification by analytical methods, particularly by mass spectrometry. 

10 In a specific embodiment of the methods herein, a peptide mixture, which may be 

generated from a variety of natural or synthetic sources, is treated to protect amine groups in 
O the peptides (N-terminal as well as those in amino acid side groups). Amines are preferably 

protected using t-Boc chemistry as is known in the art. Carboxylic acids and phosphate 
J- groups of amine-protected peptides are then condensed with free amine, preferably 
IS ethanolamine (which may be isotopically labeled) to form amides and phosphoramidates, 

respectively. Differential isotopic labeling of different samples of peptides is accomplished 
:: by treating different samples with differentially labeled amines, e.g., non-deuterated 
C : ethanolamine can be used for one sample and deuterated ethanolamine can be used for a 

second sample. Other stable isotope encoded amine reagents can be used. 

20 The peptides which are amine-protected and in which the carboxylic acid groups and 

any phosphate groups have been converted to amides and phosphoramidates, respectively, are 
then treated under mild acid conditions to selective cleave the phosphoramide bonds leaving 
the amide bonds substantially intact. Any phosphopeptides in a sample carry free phosphate 
groups and the phosphopeptides in different samples are differentially isotopically labeled. 

25 The phosphopeptides in each sample can then be selected attached to a solid support, via a 
sulfhydryl linker generated by condensation of the phosphate groups with cystamine. The 
attached cystamines are then reduced to generate free sulfhydryl groups which can react with 
iodoacetyl groups on a solid support. The cystamine functions as a latent reactive group 
which is activated for attachment to the solid support by reduction (e.g., with dithiothreitol, 

30 DTT or tris[2-carboxlethylphosphine], TCEP). 
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Phosphopeptides in samples are attached to the solid support and after extensive 
washing are cleaved off the support (e.g., using trifluoroacetic acid). Preferably, the amine 
protective groups are cleaved in this reaction as well. These steps provide purifies, 
isotopically labeled phosphopeptides which ca be subjected to tandem mass spectrometric 
5 analysis. CID mass spectra provide the sequence of any phosphopeptide present in a sample 
and indicate the presence and location of phosphorylated amino acid residues. Using the 
peptide sequence information obtained, database searching can be performed to determine the 
protein source of the phosphopeptides detected. The relative signal intensities of 
differentially isotopically labeled phosphopeptides in different samples detected in mass 
1 0 scans in the same mass spectrometer allow measurement of the ratio of abundance of the 
labeled phosphopeptides in different samples. 

z_ The methods of this invention have several advantages. Amine groups of the peptides 

Z are protected and do not form amide bonds with the carboxylic acid groups of other peptides 
minimizing peptide cross-linking and other artifactual reactions. The carboxylic acid groups 
1-5 remain protected and thus can provide for differential isotope labeling. 

r Covalent linkage of the amine labeling the phosphate group to a solid support is a 

2 preferred implementation of the method of this invention because it allows stringent washing 
of the immobilized phosphopeptides and their specific release by acid treatment. The use of 
cysteine to label phosphate groups is preferred because any peptide with residual unblocked 

20 carboxylic acid groups, whether it contains phosphate or not, will be converted into 

sulfhydryl groups and remain attached to the solid support irreversibly. The presence of 
cysteine residues interferes with the above method, however, cysteine groups can be 
optionally alkylated by iodoacetamide of any other known alkylating reagent. Before any 
protein sample is subjected to this method of phosphopeptide isolation, it can be reduced by 

25 dithiothreitol, and then alkylated by excess of iodoacetamide under denaturing condition. 

The solid phase- based embodiment of the method of this invention therefore serves as an 
efficient way for highly specific phosphopeptide purification and the stable isotope tags 
introduced by carboxyl group derivatization serves as the basis for relative phosphopeptide 
quantitation. Any solid material that can be derivatized with functional groups that facilitate 

30 attachment of the phosphopeptides (e.g., via sulfhydryl groups) and that is otherwise 
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relatively inert with respect to the peptides, reagents and washing conditions of the method 
can be used. For example, any solid phase material that is useful in solid phase peptide 
synthesis can be employed. Glass beads are a preferred solid phase material. 

With respect to fluorescent, chromophoric, radiolabels or other labels, different types 
of labels can be used to label the phosphates in difference samples. For example, different 
fluorescent labels that are separately phosphate and can be individually measured (e.g., 
fluorescein amine, rhodamine amine) can be used to label different samples and to detect 
relative amounts of labeled peptides in different samples. These labels can also be used to 
separate the peptides by RP-HPLC or CE (capillary electrophoresis) and to detect the relative 
amounts of peptides by fluorescence measurement. To implement quantitative measurement 
of relative amounts of peptides in difference samples, it is preferred to calibrate the system to 
account for differences in detection of the different labels. For example, is it preferred when 
using different fluorescent labels to calibrate for differences in quantum yields of different 
labels. 

Figure 1 illustrates the selective labeling method of this invention. As illustrated, 
peptides (1) are first reacted with an amine protective group (2), such as tBoc (t- 
Butoxydicarbonate). A variety of useful amine protective groups are known in the art and 
readily available for application in this method. The protective group selected must be 
compatible with other reagents used in the method. Peptides with protected amine groups 

(3) are then treated with a reagent that reacts to protect carboxylic acid and phosphate groups 

(4) , such as an amine. An amine group will react with a carboxylic acid group to form an 
amide (-CO-NH-) bond or with a phosphate group to form a phosphoramidate (-P0 2 -NH-) 
bond. The fully protected peptide (5) is treated to selectively cleave the phosphoramide bond 
to remove protection from the phosphate group. Mild acid conditions (6) are used to 
selectively regenerate the phosphate group. For example, the protected peptide can be treated 
with trifhioroacetic acid (tfa) (about 30% or less by volume in water for about 1 hour, 
preferably about 20% in water for about 1 hour, and more preferably 10% for 30 min. at room 
temperature). Mild acidic conditions include the use of a strong acid, such as tfa, in diluted 
form. Other mild acidic conditions that will achieve the desired chemistry can be determined 
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by routine experimentation which treatment is varied to maximize phosphoramide cleavage 
and minimize amide cleavage. 

Protected peptides in which the phosphate protective groups have been removed (7) 
are treated to selectively label the free phosphate group. For example, a linker containing a 
sulfhydryl (8) or other reactive group can be selectively bonded to the phosphate groups. The 
linker group contains functionality for bonding to the phosphate group (e.g., an amino group) 
and functionality for linking to a label or to a solid surface. Alternatively, at this point in the 
method, a label (an affinity label or any phosphate label such as a fluorescent label or 
radiolabel) carrying functionality for bonding to the phosphate group can be directly attached 
to the phosphate. As illustrated in Fig. 1, the functionality on the linker group may be a latent 
functional group which must be activated prior to reaction to add a label or to covalent attach 
to a solid surface. In the illustrated case, cystamine carries a latent sulfhydryl functional 
group ( a -S-S- bond) which is activated by addition of a reducing agent (9, such as TCEP or 
DTT) that reduces disulfide bonds. 

Figure 1 illustrates covalent linkage of phosphopep tides via the sulfhydryl functional 
group on the linker. The peptides carrying the sulfhydryl group linker (10) are reacted with 
derivatized beads carrying iodoacetyl groups (11). A variety of types of solid materials are 
available for use in this method. Solid phase materials (in the form of beads, surfaces, layers, 
etc.) are derivatize to facilitate linkage. After reaction with phosphopeptide (10), remaining 
reactive groups on the solid can be capped or protected with an appropriate capping reagent 
(e.g., with DTT (12)). Phosphopeptides covalently linked to the solid support (13) can be 
physically separated from other peptides in the samples and any non-specifically attached 
peptide can be removed with washing. 

Figure 1 illustrates a step of treating the bound phosphopeptides with hydroxylamine 
(14). More specifically, beads carrying bound peptide (13) are washed and incubated in 1M 
hydroxylamine (14, pH 10.0) for about 2 hrs. to regenerate tyrosine, since tyrosine residues 
can form adducts with carbodiimide. This step is optionally, but preferred, when 
carbodiimide reagents are employed in the method, to regenerate tyrosines in the bound 
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peptide which may have formed adducts with the carbodiimide reagent. It has been found that 
treatment with 5% hydroxylamine for about 30 min. is sufficient to regenerate tyrosine. 

Unbound peptides are removed by sequential washing with 5M sodium chloride, 
acetonitrile and water. The volume of each washing step will minimally consist of about 10 

5 column bed volumes, but larger volumes can also be employed. The bound phosphopeptides 
are then treated under strong acid conditions (15) to cleave the linker to the solid surface and 
to remove the amine protective group (2). The carboxylic acid groups of the free (i.e., 
released) phosphopeptide (16) are, however, still protected. These protective groups can be 
employed to carry differential isotopic labels that are useful for quantitative peptide analysis 

10 by mass spectrometry. It may also be desirable to employ conditions that will selectively 
cleave the phosphopeptide from the solid support without cleavage of the amine reactive 
protective group (2).. A variety of protective groups which are cleaved using a variety of 
different conditions are available in the art. One of ordinary skill in the art can select a 
protective group, a linker and cleavage conditions that will allow retention of an amine 

15 reactive protective group on cleavage of the linker from the solid support material. If the 
amine reactive protective group is retained after cleavage of the peptide from the solid 

f- surface, it can also be used for differentially isotopic labeling. 

Dried separated phosphopeptides are resuspended in water for LC-MS analysis 
20 Phosphopeptides selectively labeled and isolated by the method herein are preferably 
analyzed by mass spectrometric techniques. Protective groups and linkers used in this 
method are preferably selected so that any modifications to the peptide that are retained on 
release from the solid phase material do not significantly interfere with mass spectral analysis 
and sequencing of the peptide by tandem mass spectrometric methods. 

25 U.S. patent application serial number 09/383,062 filed August 25, 1999, and 

corresponding International Patent Application W099/19415, filed August 25, 1999, having 
inventors in common with this application, provide analytical reagents and mass 
spectrometry-based methods for the rapid, and quantitative analysis of proteins or protein 
function in mixtures of proteins. The methods employ reagents designated "affinity-labeled 

30 protein reactive reagents" that allow for the selective isolation of a peptide from complex 
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mixtures. The reagent contains an affinity label covalently linked through a linker group to a 
protein reactive group that selectively reacts with certain protein functional groups. The 
linker may be differentially isotopically labeled. The reagents and methods can be applied to 
the detection and identification of proteins in complex mixtures of proteins, where the 

5 peptides isolated by the method are characteristic of the presence of a protein in the mixture. 
Isolated peptides are characterized by mass spectrometric (MS) techniques. In particular, the 
sequence of isolated peptides can be determined using tandem MS (MS n ) techniques, and by 
application of sequence database searching techniques, the protein from which the sequenced 
peptide originates can be identified. The affinity-labeled protein reactive reagents can also 
10 provide for differential isotopic labeling of the isolated peptides to facilitate quantitative 
determination of the relative amounts of proteins in different samples and provide for 
internal standards to facilitate quantitative determination of the absolute amounts of one or 

5 more proteins present in a sample. The present invention provides a method for selective 

0 labeling of phosphopeptides in complex mixtures and their selective isolation that can be 

1 jf employed with the methods and applications described in U.S. patent application serial no. 
W 09/383/062 US and International Patent Application WO 99/19415. These patent 

D applications are incorporated by reference herein in their entirety to the extent that they are 

not inconsistent with the disclosures herein, among other things for descriptions of 
fU differential isotopic labeling, mass spectrometric methods and application of the selective 
2Dl labeling methods described. 

The following references relate to the application of mass spectrometric techniques to 
protein identification, particularly with respect to proteome analysis: Ideker T, Thorsson V, 
Ranish JA, Christmas R, Buhler J, Eng JK, Bumgarner R, Goodlett DR, Aebersold R, Hood 
L "Integrated genomic and proteomic analyses of a systematically perturbed metabolic 

25 network." Science. 2001 May 4;292(5518):929-34; Gygi SP, Aebersold R. "Mass 

spectrometry and proteomics." Curr Opin Chem Biol. 2000 Oct;4(5):489-94.; Gygi SP, Rist 
B, Aebersold R "Measuring gene expression by quantitative proteome analysis" Curr Opin 
Biotechnol." 2000 Aug;ll(4):396-401; Goodlett DR, Bruce JE, Anderson GA, RistB, Pasa- 
Tolic L, Fiehn O, Smith RD, Aebersold R.'Trotein identification with a single accurate mass 

30 of a cysteine-containing peptide and constrained database searching."Anal Chem. 2000 Mar 
15;72(6):1 1 12-8.; and Goodlett DR, Aebersold R, Watts JD."Quantitative in vitro kinase 
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reaction as a guide for phosphoprotein analysis by mass spectrometry."Rapid Commun Mass 
Spectrom. 2000;14(5):344-8; Zhou, H. et al (Apr 2001) Nature Biotechnol. 19:375-378. 
These reference are incorporated by reference herein to the extent that they are not 
inconsistent with the disclosure of this application. 

Peptide mixtures subjected to the method of this invention can be generated from 
natural or synthetic samples and may be the results of chemical, physical or enzymatic 
digestion of protein samples. Proteins can be digested using any enzymatic appropriate 
method, such as trypsin digestion. Peptides in the digest preferably range in size from about 
10 to about 50 amino acids in length and are more preferably sized to facilitate peptide 
sequencing using tandem mass spectrometric methods. Those of ordinary skill in the art can 
select a protein digestion protocol suitable for use in protein sample(s) of interest. 

Amines are preferred reagents for selective labeling of carboxylic acids in the 
presence of phosphate groups. Any amine reagent will generally provide the function of this 
protective group. Alkanolamine, such as ethanolamine are preferred amine reagents. One of 
ordinary skill in the art will appreciate that other reagents can be found which provide a 
similar selective labeling function. One or ordinary skill in the art can identify and selection 
other reagents for selectively labeling without resort to undue experimentation. The 
protective group employed must, however, also be suitable for use in the reactions of this 
invention. Any art-known methods and reagents suitable for selective protection of 
carboxylic acid groups as described herein are intended to be encompassed by this invention. 
With respect to amines that are used to selectively protect carboxylic acid groups, the reaction 
of amines with carboxylic acids and phosphate groups is preferably done in the presence of a 
coupling agent. Coupling agents that can be used in this reaction include, among others, 
dicyclohexylcarbodiimide, or 2,3,5,6-tetrafluorophenyl trifluoroacetate. In addition, a 
coupling catalyst such as 4-dimethylaminopyridine can be employed. 

The selective labeling method of this invention employs a treatment that selectively 
removes protection from phosphate groups, but not carboxylic acid groups. In particular, the 
protected peptides are treated with acidic conditions that cleave phosphoamidate bonds, but 
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not amide bonds (herein mild acidic conditions). Treatment under these mild acidic 
conditions cleaves the phosphoamide bonds between phosphate and ethanolamine without 
deprotecting the amine and carboxylic acid groups of the peptide. For example, tBoc 
protection remains mostly intact, unless prolonged acid treatment is involved. Those of 
ordinary skill in the art will appreciate that other treatment conditions may be found which 
provide for a similar function. One or ordinary skill in the art can identify and select other 
reagents for selectively removal of protecting groups on phosphates without resort to undue 
experimentation. Any such methods and reagents known in the art which achieve the 
selective removal of phosphate protective groups as described herein are intended to be 
encompassed by this invention. 

If desired, selectively labeled peptides can be attached to a solid support by attaching 
a linker group carrying an appropriate functional group for bonding to the support. 
Attachment of phosphopeptides to solid supports is exemplified by attachment via sulfnydryl 
group reaction with iodine. Those of ordinary skill in the art appreciate that functional 
groups other than sulfnydryl and iodine can be used to complete a linkage to a solid support 
material. A variety of methods for making such attachments are known in the art. Any 
method and reagents that achieve the function of selective attachment of the phosphopeptide 
to a solid support are intended to be encompassed by this invention. 

The method of this invention as specifically exemplified employs steps of washing 
peptides on Reverse Phase columns to remove undesired materials from the peptide sample. 
Those of ordinary skill in the art will appreciate that methods for removing such materials 
other than those specifically described herein are known in the art and can be readily applied 
to the method herein to achieve the desired result. All such art-known methods for washing 
or removal of undesired materials are intended to be encompassed by this invention. 

A strategy for quantitative, comparative analysis of protein phosphorylation in one or 
more samples is illustrated in Figure 2. Peptide samples are prepared from two cell states (1 
and 2). As an example, differentially isotopically labeled carboxylic acid/ phosphate group 
amine protective reagents (one for each sample, e.g., dO- or d4-ethanolamine for each of two 
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samples) are used to differentially isotopically label all carboxylic acid groups and initially 
any phosphates in peptide samples. Carboxyl groups include the C-terminus of a peptide and 
the side chains of glutamic acid and aspartic acid residues (as well as any carboxylic acid 
side groups from rare or non-naturally-occurring amino acids). Ethanolamine groups 
5 protecting the phosphate groups are selectively removed to generate free phosphate groups. 
Free phosphate groups are then derivatized with a linker that facilitated separation of 
phosphopeptides. For every phosphopeptide ultimately purified using this method, there is at 
least one labeled protective group (e.g., ethanolamine) attached to the C terminus of the 
peptide. Dependent upon its structure, a given phosphopeptide may have more than one 
10 lab eled protective group . 

The mass difference between differentially isotopically labeled peaks will depend on 
the isotopic mass difference among the labels and the charge state of the peptides, which can 
~ be determined in the mass spectrometer itself based on the natural isotope distribution. Since 

isotopic-related peptides essentially co-elute from a microcapillary high performance liquid 
15 column chromatography (HPLC) run, as they are analyzed by the mass spectrometer, 
^ multiplets of peaks due to a given peptide appear for each differentially labeled peptide (e.g., 
~ a doublet for two samples labeled differentially with dO- and d4-ethanolamine). The relative 
intensity of the peaks in the multiplet (e.g., doublet) of peaks from the same peptide in 
differentially isotopically labeled samples directly yields the relative concentrations of that 
20 peptide in the different samples. The underlying principle of this quantification method is 
that isotopically related peptides are chemically identical and therefore represent a perfect 
mutual internal standard. The intensities of the signals generated in the mass spectrometer 
from the differentially isotopically labeled peptides from different samples precisely reflect 
relative quantities of the peptide molecules present in those samples, respectively. 

25 The sequence of a phosphopeptide and the identification of the site(s) of 

phosphorylation can be determined by a combination of tandem mass spectrometry and 
computer-assisted database search programs, such as SEQUEST (Trademark, University of 
Washington, Seattle WA) (McCormack, A.L. et al. (1996) "Direct Analysis and Identification 
of Proteins in Mixtures by LC/MS/MS and Database Searching at the Low-Femtomole 
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Level", Anal. Chem. 69, 767-776; Eng, J.K. et al. (1994)" An Approach to Correlate Tandem 
Mass Spectral Data of Peptides with Amino Acid Sequences in a Protein Database" J. Amer. 
Soc. Mass. Spectrom., 5, 976-989; U. S. Patent 5,538,897 (Jul. 23, 1996) Yates, III et al.) In 
the first stage of a tandem mass spectrometer, any given phosphopeptide is selected and 
subjected to collision induced dissociation (CID). The spectra of a resulting fragment ion is 
recorded in the second stage of the mass spectrometry, as a so-called CID spectrum. This 
process is repeated with other (ideally all) peptides present in the sample. Because the CID 
process usually causes fragmentation at peptide bonds and different amino acids for the most 
part yield peaks of different masses, a CDI spectrum alone often provides enough information 
to determine a peptide sequence. Peptide sequencing and protein identification is facilitated 
by using a sequence searching computer program, such as SEQUEST™, which takes all 
known genomic sequence, computes all possible theoretical CID spectra and compares them 
to experimental CID spectra for matches and sequence identification. The mass modification 
to the C-terminus, glutamic acid, aspartic acids and any other acidic side groups are known 
and this information can be incorporated into the computer analysis. Also mass changes due 
to phosphorylation are also known and can be incorporated into the computer analysis. Data 
can be searched for any possible phosphorylations to serine, tyrosine, and threonine residues, 
thus allowing the identification of sites of phosphorylation. 

The methods of this invention can employ protective groups which are isotopically 
labeled to generate pairs or sets of reagents that are substantially chemically identical, but 
which are distinguishable by mass. For example a pair of protective group reagents, one of 
which is isotopically heavy and the other of which is isotopically light can be employed for 
the comparison of two samples, one of which may be a reference sample containing one or 
more known proteins in known amounts. For example, any one or more of the hydrogen, 
nitrogen, oxygen or sulfur atoms in the protective group may be replaced with their 
isotopically stable isotopes: 2 H, 13 C, 15 N, 17 0, ls O or 34 S. Differential isotopic labeling is 
preferably introduced into the peptides of this invention in the carboxylic acid protective 
group. 
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The methods of this invention can use affinity labels or phosphate labels for the 
detection or separation of phosphopeptide. The methods of this invention can use any 
phosphate label, including, but not limited to radiolabels, fluorescent labels, colorimetric 
labels, etc. The label is selectively attached to a phosphate in a phosphopeptide in the sample 
and its detection detects the presence of the phosphate. 

Suitable affinity labels bind selectively either covalently or non-covalently and with 
high affinity to a capture reagent (CR). The CR-A interaction or bond should remain intact 
after extensive and multiple washings with a variety of solutions to remove non-specifically 
bound components. The affinity label binds minimally or preferably not at all to components 
in the assay system, except CR, and does not significantly bind to surfaces of reaction 
vessels. Any non-specific interaction of the affinity label with other components or surfaces 
should be disrupted by multiple washes that leave CR-A intact. Further, it must also be 
possible to disrupt the interaction of A and CR to release peptides, substrates or reaction 
products, for example, by addition of a displacing ligand or by changing the temperature or 
solvent conditions. Preferably, neither CR or A react chemically with other components in 
the assay system and both groups should be chemically stable over the time period of an 
assay or experiment. The affinity label is preferably soluble in the sample liquid to be 
analyzed and the CR should remain soluble in the sample liquid even though attached to an 
insoluble resin such as Agarose or controlled pore glass. With respect to CR, the term 
soluble means that CR is sufficiently hydrated or otherwise solvated such that it functions 
properly for binding to A. CR or CR-containing conjugates should not be present in the 
sample to be analyzed, except when added to capture A. Affinity labels useful in this 
invention contain functionality that allows them to be bonded (preferably via a covalent bond 
which can be selectively cleaved, if and when desired) to a phosphate group. 

Examples of A and CR pairs include: 

d-biotin or structurally modified biotin-based reagents, including d-iminobiotin, 
which bind to proteins of the avidin/streptavidin, which may, for example, be used in the 
forms of strepavidin- Agarose, oligomeric-avidin- Agarose, or monomeric-avidin- Agarose; 
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any 1,2-diol, such as 1 ,2-dihydroxyethane (HO-CH 2 -CH 2 -OH), and other 1,2- 
dihyroxyalkanes including those of cyclic alkanes, e.g., 1 ,2-dihydroxycyclohexane which 
bind to an alkyl or aryl boronic acid or boronic acid esters , such as phenyl-B(OH) 2 or hexyl- 
B(OEthyl) 2 which may be attached via the alkyl or aryl group to a solid support material, 
such as Agarose; 

maltose which binds to maltose binding protein (as well as any other sugar/sugar 
binding protein pair or more generally to any ligand/ligand binding protein pairs that has 
properties discussed above); 

a hapten, such as dinitrophenyl group, for any antibody where the hapten binds to an 
anti-hapten antibody that recognizes the hapten, for example the dinitrophenyl group will 
bind to an anti-dinitrophenyl-IgG; 

a ligand which binds to a transition metal, for example, an oligomeric histidine will 
bind to Ni(II), the transition metal CR may be used in the form of a resin bound chelated 
transition metal, such as nitrilotriacetic acid-chelated Ni(II) or iminodiacetic acid-chelated 
Ni(II); 

glutathione which binds to glutathione-S-transferase. 

The covalent attachment of A to CR can be accomplished, for example, by the 
reaction of iodoacetamide on CR with a sulfhydryl group on A. 

In general, any A-CR pair commonly used for affinity enrichment which meets the 
suitability criteria discussed above can be employed. Biotin and biotin-based affinity tags are 
preferred. Of particular interest are structurally modified biotins, such as d-iminobiotin, 
which will elute from avidin or strepavidin columns under solvent conditions compatible 
with ESI-MS analysis, such as dilute acids containing 10-20% organic solvent. It is expected 
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that d-iminobiotin tagged compounds will elute in solvents below pH 4. d-Iminobiotin 
tagged protein reactive reagents can be synthesized by methods described herein for the 
corresponding biotin tagged reagents. 

A displacement ligand, DL, is optionally used to displace A from CR. Suitable DLs 
are not typically present in samples unless added. DL should be chemically and 
enzymatically stable in the sample to be analyzed and should not react with or bind to 
components (other than CR) in samples or bind non-specifically to reaction vessel walls. DL 
preferably does not undergo peptide-like fragmentation during MS analysis, and its presence 
in sample should not significantly suppress the ionization of tagged peptide, substrate or 
reaction product conjugates. 

DL itself preferably is minimally ionized during mass spectrometric analysis and the 
formation of ions composed of DL clusters is preferably minimal. The selection of DL, 
depends upon the A and CR groups that are employed. In general, DL is selected to displace 
A from CR in a reasonable time scale, at most within a week of its addition, but more 
preferably within a few minutes or up to an hour. The affinity of DL for CR should be 
comparable or stronger than the affinity of the tagged compounds containing A for CR. 
Furthermore, DL should be soluble in the solvent used during the elution of tagged 
compounds containing A from CR. DL preferably is free A or a derivative or structural 
modification of A. Examples of DL include, d-biotin or d-biotin derivatives, particularly 
those containing groups that suppress cluster formation or suppress ionization in MS. 

The method of this invention can employ linker groups which bond to phosphate 
groups to attach the phosphopeptide to a solid support. A linker can also be used to attach an 
affinity label or phosphate label to a phosphopeptide. Any linker used should preferably be 
soluble in the sample liquid to be analyzed and it should be stable with respect to chemical 
reaction, e.g., substantially chemically inert, with components of the sample as well as with 
any other reagents used in the method. The linker when bound to the peptide should not 
interfere with the specific interaction of an affinity label with a CR and should bind 
minimally or preferably not at all to other components in the system, to reaction vessel 
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surfaces or CR. Any non-specific interactions of the linker should be broken after multiple 
washes. 

Samples that can be analyzed by methods of this invention include cell homogenates; 
cell fractions; biological fluids including urine, blood, and cerebrospinal fluid; tissue 
homogenates; tears; feces; saliva; lavage fluids such as lung or peritoneal lavages; mixtures 
of biological molecules including proteins, lipids, carbohydrates and nucleic acids generated 
by partial or complete fractionation of cell or tissue homogenates. 

The methods of this invention employ mass spectrometric and tandem mass 
spectrometric methods. While a variety of MS methods are available and may be used in 
these methods, Matrix Assisted Laser Desorption Ionization MS (MALDI/MS) and 
Electrospray Ionization MS (ESI/MS) methods are preferred. 

The method of this invention is illustrated and exemplified by application to the 
detection and identification of phosphopeptides in a phosphopeptide standard sample, 
(Example 2, Fig. 3A-C); an enzymatic digest of a single phosphoprotein, (3-casein (Example 
3, Fig. 4A-4D); a protein phosphorylated in vitro at a tyrosine residue (Example 4, Fig. 5A- 
C), a digest of total lysate of yeast cells (Example 5, Figs. 6A-C and Table 1); and a digest of 
total lysate of Jurkat cells (Table 2). 

The following examples are intended to further illustrate the invention and are not 
intended to limit the invention. 
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EXAMPLES 
Example l:Phosphopeptide Isolation Procedure 

Peptide samples were dried, and then subjected to the method shown in Fig. 1A 
according to the following steps. 1) Peptide mixture was resuspended in 50% (v/v) of 0.1 M 
phosphate buffer (pH ll)/acetonitrile. 0.1 M of t-Butyl-dicarbonate (tBoc) was added for 4 
hours at room temperature. 2) Acetonitrile was removed under reduced pressure. Samples 
were made to 1 M ethanolamine, 25 mM N-hydroxysuccinimide (NHS) and 0.5 M of N,N'- 
dimethylaminopropyl ethyl carbodiimide* HC1 (EDC) and incubated 2 hours at room 
temperature. 3) 10% trifluoroacetic acid (TFA) was added for 30 minutes at room 
temperature. Longer treatment under these conditions did not detrimentally effect the results. 
Samples can be neutralized at this point, but neutralization was found to have no significant 
effect on results. Samples were then desalted on and recovered from a CI 8 column (Waters 
Associates, Milford, MA WAT 023590) using elution with 80% acetonitrile, 0.1% TFA. 4) 
Peptides were dried and redissolved in 1 M imidazole (pH 6.0). Imidazole is optional and 
was employed to inhibit possible carbodiimide adduct formation with sensitive amino acids 
such as histidine. 0.5 M EDC was added for 3 hours at room temperature. Samples were 
loaded on a CI 8 column, washed with water and treated with 1 M cystamine (pH 8.0) for 2 
hours at 50 °C on the column. Peptides were washed with water and reduced with 10 mM 
DTT to generate free sulfhydryl groups. 5) After washing to remove DTT, peptides were 
eluted with 80% acetonitrile, 0.1% TFA and incubated with 20 mg beads with immobilized 
iodoacetyl groups for at least 2 hours at pH 8.0 (titrated with 1 M Tris pH 8.0, 50 mM 
EDTA). Beads with immobilized iodoacetyl groups were prepared by a 2-hour reaction 
between 3 equivalents of iodoacetic anhydride and 1 equivalent of amino beads (Sigma, 
G4643) with 3.3 equivalent of diisopropylethylamine in dimethylformide. The formation of a 
tyrosine adduct with carbodiimide is a possible side reaction. Such an adduct is unstable 
against nucleophiles such as hydroxylamine. Therefore, after attachment of phosphopeptide 
to the beads, 1 M hydroxylamine (pH 10) was used to incubate beads for 2hours at room 
temperature. This restored tyrosine residues. It has been found that treatment with 5% 
hydroxylamine solution for 30 min. is typically sufficient to restore tyrosine residues. Beads 
were then washed sequentially with 2 M NaCl, methanol and water to remove nonspecifically 
bound molecules. 6) The beads were incubated with 100% TFA for 30 minutes to recover 
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phosphopeptides. Concurrently, tBoc protection was removed. The recovered sample was 
dried under reduced pressure and resuspended in water for LC-MS/MS analysis. 

Example 2 

Two separate samples of equal amounts of phospho angiotensin peptide were analyzed 
5 by the method of this invention. The carboxylic acid groups in the two different samples 

were blocked (leaving phosphate groups free as described above) by either light ethanolamine 
(dO-ethanolamine) or heavy ethanolamine (d4-ethanolamine, HOCD 2 CD 2 NH 2 ). 
Phosphoangiotensin contains two carboxylic acid groups, so that the mass difference for the 
[M+2H] 2+ ion is 4 for the differentially labeled peptides. The results of mass spectrometric 
10 analysis of the differentially labeled samples that were subjected to selective labeling and 
separation of phosphopeptides of this invention is illustrated in Figs. 3A-C. A doublet of 
peaks [M+2H] 2+ at m/z= 607 and 611, due to light and heavy labeled samples, respectively, is 
- observed as expected. Further the relative ratios of the two peaks is about 1:1 as expected. 
Z The CID spectrum of each of these peaks is similar to that of the unprotected peptide, except 
15 for the fragment ions that are mass shifted by the modification (label attachment). 

Modifications to the carboxylic acid groups used to achieve differential labeling do not 
adversely affect the quality of the CID spectrum that would be used to identify the sequence 
Z of unknown peptides . 

Phosphopeptide isolation was performed essentially as in Example 1 with minor 
20 modifications. Peptides were suspended in 50% (v/v) acetonitrile and 0.1 M potassium 
phosphate buffer (pH 1 1). t-Butyloxy-carbonyl (t-Boc, 1M) was added for 4 h at room 
temperature. Acetonitrile was removed under reduced pressure. The amine-protected peptide 
was dried under reduced pressure and resuspended in 1 M ethanolamine HC1. The pH of the 
solution was adjusted to approximately 6 by addition of 50 mM N-hydroxy succinimide 
25 (NHS). N,N'-demethylaminopropylethyl carbodiimide-HCl (EDC) was added (5 mg/50uL) 
for 4 h at room temperature. Amine-protected peptide was labeled with the heavy 
ethanolamine (d4-ethanolamine, HOC0 2 CD 2 -NH 2 ) in a similar manner. 

Phosphates were selectively deprotected in the differentially isotopically labeled 
peptides by mixing each solution containing the labeled peptide with an equal volume of 20% 
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(v/v) trifluoroacetic acid for 1 h and thereafter neutralizing the reactions using 2 M sodium 
phosphate buffer. The neutralized solution was loaded on a reverse phase CI 8 column and 
washed extensively with water. The peptide with regenerated phosphate groups was eluted 
from the C-18 column using 80% (v/v) acetonitrile in water and then dried. Samples of 
5 heavy and light isotopically labeled peptide were combined for mass spectrometric analysis. 

An LCQ ion trap mass spectrometer (Finnigam MAT, San Jose, CA) was used with 
an HP 1 100 solvent delivery system (Agilent, Palo Alto, CA). Peptides were pressure-loaded 
onto the column, then eluted and analyzed by microcapillary LS-MS/MS as described in 
Gygi, S.P. et al. 1999 supra . The collision energy or the LCQ was set at 30%. 



10 Example 4 : Isolation of Phosphopeptides from [3-casein 

The methods of this invention were also used to purify and detect phosphopeptides 
from bovine p-casein, a well-characterized phosphoprotein. The peptide was labeled as 
described in Example 2. A tryptic digest of the phosphoprotein was analyzed by 
microcapillary LC-MS/MS. As shown in Fig. 4A, numerous peptides were observed for the 

15 untreated p-casein digest. The peptide indicated in Fig. 4A was a doubly charged ion at 

* " m/z=l 03 1 .6. When selected for fragmentation via collision induced dissociation (CID) (Fig. 
4C) (Papayannopoulos, I. A. (1995), "The interpretation of collision-induced dissociation 
tandem mass spectra of peptides," Mass Spectrometry Rev. 14,49-73), its fragment ion 
spectrum exhibited mostly the y-ion series typical for low energy peptide fragmentation and 

20 an additional major signal at m/z=983.0 corresponding to a loss of 98 Da due to the loss of 
the H 3 P0 4 group from the parent ion(Jonscher, K.R. and Yates, J.R. Ill (1997), "Matrix- 
assisted laser desorption ionization/quadrupole ion trap mass spectrometry of peptides. 
Application to the localization of phosphorylation sites on the P protein from Sendai virus," 
J. Biol. Chem. 272, 1735-1741; Qin, J. & Chait, B.T. (1997), "Identification and 

25 characterization of posttranslational modifications of proteins by MALDI ion trap mass 
spectrometry," Anal. Chem. 69, 4002-4009). Database searching of this CID spectrum 
identified a peptide with sequence FQS *EEQQQTEDELQDK (* denotes a phosphate group). 
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The mass difference between the yl3 and yl4 ions corresponded to that of phosphoserine, 
confirming Ser-50 of this protein as the known site of p-casein phosphorylation. 

Subjecting the same p-casein digest to the phosphopeptide isolation procedure 
reduced sample complexity considerably, yielding only one significant doubly charged 
5 peptide ion at m/z=l 182.5 (Fig. 4D). The CID spectrum of this peptide showed a clear 
fragment ion series and a major signal at m/z=\ 133.6 due to the loss of H 3 P0 4 (Fig. 4D). 
Database searching of this spectrum identified the same peptide as that in Fig. 4B. The 
increase in apparent mass for the same peptide (compare Fig. 4A with 4B, and 4C with 4D) is 
due to quantitative modification on all seven carboxylates (aspartic acid, glutamic acid, and 
10 the C terminus) with ethanolamine during the isolation procedure. 

Example 4 

Sample recovery efficiency for the steps of the invention was examined using a 
phosphotyrosine-containing peptide, the myelin-basic protein (MBP). MBP was 
phosphorylated in vitro using the catalytic domain of the tyrosine kinase Lck and 
L5 radiolabeled ATP (at a known specific activity). The phosphorylated peptide was digested 

- with trypsin and 5 pmol of phosphopeptides were isolated as before, except that the 

- carboxylate groups were blocked with d4-ethanol-amine. 

The ion chromatogram for the isolated phosphopeptides is illustrated in Fig. 5A where 
the most prominent ion at m/2 = 630.1 (2+) was chosen for fragmentation. The CID of this 

20 ion is shown in Figure 5B. This unambiguously identified the phosphopeptide as 

THY*GSLPQK of M.P.(Aebersold, R. et al. (1991), "Determination of the site of tyrosine 
phosphorylation a the low picomole level by automated solid-phase sequence analysis," Anal. 
Biochem. 199:51-60). Phosphopeptide recovery efficiency over the entire six-step procedure 
was assessed by measuring radioactive counts recovered after each step of the procedure, 

25 with a final yield that was consistently about -20% of the starting material. 
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In vitro kinase reaction 

17 |ag of a baculovirally expressed Lck kinase domain-GST (glutathione 5- 
transferase) fusion protein, 20 jag MBP and 10 u€i of 32 P-containing ATP were incubated at 
30°C for 1 hour in 40 ul of buffer containing 25 mM Tris (pH 7.5), 10 mM MnCl 2 , 0.25 mM 
5 ATP. After 1 hour, 6 M urea was added to stop the reaction. Reduction and alkylation were 
performed by the addition of dithiothreitol (DTT) at 10 mM for 30 minutes, followed by 2 
hours of incubation at 50 mM iodoacetamide. Samples were diluted with water to 1 ml, and 1 
ug of trypsin (Promega, Madison, WI) was added for 4 hours at 37°C. Peptides were then 
desalted on a CI 8 column (Waters, MA, cat. no. WAT023590) and recovered by elution with 
1 0 80% acetonitrile/0. 1% TFA. Recovered radiolabeled peptides were quantified via Cerenkov 
counting. From this, an estimated 5 pmol of phosphorylated peptides were taken for isolation 
of phosphopeptides and evaluation of recovery efficiencies. Deuterated d4-ethanolamine 
(Isotec, Miamisburg, OH) was used to block carboxylate groups in this experiment. 

s _ Example 5 : Phosphoprotein profiling in yeast 

1 5 Yeast 5". cerevisiae strain (BWG1-7A) was grown till mid-log phase on YPD media 

with 2% glucose as a carbon source and harvested by centrifugation. Protein extract was 
prepared by the glass beads method as described in Current Protocols in Molecular Biology 
(New York, J. Wiley). A mixture of DNAse 1 (20 U/ml) and RNAse (10 pg/ml) was added 
for 30 minutes on ice. Protein concentration was determined using Biorad protein assay and 

20 500 p.g of the protein extract was then denatured in 0. 1 M potassium phosphate buffer (pH 
8.0) with 6 M urea. Proteins were reduced and alkylated by addition of DTT (lOmM, 30 
min)followed by 2 hours of incubation with iodoacetamide. Samples were then dialyzed 
prior to digestion overnight with trypsin at 37 °C. The resulting peptide mixture was desalted 
by reverse phase CI 8 column as described above. Samples were treated as in Example 1. 

25 Phosphopeptides were isolated by this method and analyzed by LC-MS/MS, with CID 

spectra being recorded and searched against the yeast sequence database. Fig. 6 A shows the 
total ion intensity recorded with respect to retention time on the column, indicating the 
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complexity of the sample. Fig. 6B shows the m/z values observed integrated over the time 
window indicated in Fig. 6 A. The major peptide peaks observed which also displayed loss of 
98 Da during CID are labeled with an asterisk (*), confirming that the majority of the 
peptides detected were indeed phosphorylated. Further more, the selectivity of the method 
was apparent by virtue of the fact that over 80% of the CID spectra that led to identification 
were derived from phosphopeptides. Additionally, CID spectra derived from the few non- 
phosphorylated peptides identified generally resulted from lower intensity precursors ions. 
Thus even with a highly complex starting material, only low level of non-specific peptide 
background carried through the isolation procedure to the MS, affirming its selectivity. In the 
example shown, the ion at m/z=\ 032.7 in Fig. 6B was selected for CID, this spectrum being 
shown in Fig. 6C. In addition to observing a clear fragment ion series, a major signal 
corresponding to the doubly charged parent ion after undergoing the loss of H 3 P0 4 is 
apparent at ra/z=983.8. 

Following database searching, the peptide was identified as being from enolase and 
having the indicated sequence. This peptide contained three potential threonine 
phosphorylation sites, and the parent ion mass indicated that the peptide contained a single 
phosphate group. The y-7 to y-13 ions confirmed that the phosphate was not on the N- 
terminal threonine. Two possible pairs of the y-5 and y-6 ions correspond to either one of the 
other two threonine residues being phosphorylated. Thus the exact site of phosphorylation for 
this peptide could not be determined. Further, the possibility that a mixture of both possible 
mono-phosphorylated species of this peptide may have co-eluted from the LC column can not 
be excluded. 

Table 1 lists the proteins (and gene names) identified following database searching of 
the additional CID spectra obtained in the same manner, along with the sequences of the 
phosphopeptides determined. All peptides positively identified were singly phosphorylated 
species and they were phosphorylated at serine or threonine residues. Table 1 also indicates 
the location of the phosphorylation site within the peptide when this could be unequivocally 
determined, or gives the possible phosphorylation sites in cases in which the observed CID 
data could not distinguish between two or more phosphorylation sites. As was seen in Fig. 
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6C, such occasions did not interfere with the identification of the phosphopeptide, and the site 
of phosphorylation could typically be confined to a cluster of hydroxyl amino acids. 

No peptides with multiple serine or threonine phosphorylation sites were identified. In 
many cases, ions corresponding to loss of H 3 P0 4 dominated the fragmentation process, 
resulting in insufficient fragmentation at peptide bonds for sequencing. This effect would be 
compounded by multiple phosphoserine- or phosphothreonine sites in a single peptide. 
Peptides of too large or small sizes are generally not suitable for MS sequencing; 
additionally, such peptides could be lost during the desalting steps in this method. Whether 
this method could completely determine all the phosphorylation sites of a given protein 
therefore depends on whether phosphorylation sites are contained in peptides of suitable 
sizes/hydrophobicities for MS analysis, a limitation common to all MS-based methods. In 
such cases, alternative proteolytic enzymes can be considered. In the experiment on yeast 
lysate, tyrosine phosphorylated peptides were not identified, likely due to its significantly 
lower abundance. 

Most of the proteins identified were found to be glycolytic enzymes, including 
enolase, glyceraldehyde 3 -phosphate dehydrogenase, phosphoglycerate kinase and pyruvate 
kinase. Since the cells from which the proteins were isolated utilized glucose as a carbon 
source, the identification of phosphorylation sites on glycolytic enzymes as being the major 
species present in this sample is perhaps not surprising. Phosphopeptides from other highly 
expressed proteins, such as pyruvate decarboxylase isozyme 1, ribosomal proteins and heat 
shock proteins were also identified. Interestingly, most of the proteins listed in Table 1 were 
not annotated in the database as being known phosphoproteins. However, previous studies 
from our group and others identified many of the proteins listed in Table 1 in multiple 2D gel 
spots (Gygi, S.P. et al. (1999), "Correlation between protein and mRNA abundance in yeast," 
Mol. Cell. Biol. 19, 1720-1730; Futcher B. et al. (1999) Mol. Cell. Biol. 19, 7357-68), 
consistent with there being differentially phosphorylated forms of the same protein. These 2D 
gel data are thus consistent with the identifications made here, supporting the assertion that 
these proteins are indeed phosphorylated in vivo. Although phosphopeptides from lower 
abundant, regulatory proteins were not identified in this experiment, the method itself 
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nonetheless is compatible with larger scale sample preparation, or for analysis of enriched 
protein complexes of interest. Subsequent fractionation either prior to or following this 
isolation should greatly facilitate identification of lower abundance proteins. 

LC-MS/MS and database analysis 

An LCQ ion trap mass spectrometer (Finnigan MAT, CA) was used with a HP 1 100 
solvent delivery system (Agilent, CA). Peptides were pressure-loaded onto the column, then 
eluted and analyzed by microcapillary LC-MS/MS as described previously (Gygi, S.P. et al. 
(1999), "Quantitative analysis of complex protein mixtures using isotope-coded affinity tags," 
Nat. Biotechnol. 17, 994-999). The collision energy for the LCQ was set at 30%. SEQUEST 
(Eng, J. et al. (1994), "An approach to correlate tandem mass spectral data of peptides with 
amino acid sequences in a protein database," J. Am. Soc. Mass Spectrom. 5, 976-989) 
was used for searching yeast YPD database for peptide sequence and identification of site(s) 
of phosphorylation. Search parameters included differential mass modification to serine, 
threonine and tyrosine due to phosphorylation, static mass modification to aspartate, 
glutamate and C-terminus due to the ethanolamine tag, and static mass modification to 
cysteine due to alkylation by iodoacetamide. Differential mass modification means both 
possibilities of modified or unmodified amino acid residues were used in database search, 
while static mass modification means only the modified amino acid residue was used. 

Using procedures similar to those outlined above, phosphopeptides were isolated and 
identified from Jurkat cells. A list of phosphopeptides identified in these experiments is 
provided in Table 2. 

Those of ordinary skill in the art will appreciate that protective groups, labels, 
reagents, solid phase materials, acid treatments (mild or strong), isotopic labels, purification 
and washing procedures other than those specifically disclosed herein can be employed to 
practice the method of this invention. A variety of functional equivalent reagents, methods 
and techniques in addition to those specifically disclosed herein are known in the art and can 
be readily employed or adapted without expense of undue experimentation to the practice of 
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this invention. All art-known functional equivalents and known variants of the materials and 
methods specifically used herein are intended to be encompassed by this invention. 

All references cited herein are incorporated by reference herein to the extent that they 
are not inconsistent herewith. 
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Table 1. Phosphopeptide profile from yeast grown with glucose as a carbon source. 





Gene name a 


Protein name 


Phosphopeptide identified b 




EN01_yeast& 
EN02_yeast 


Enolase 


TAGIQIVADDLT* VT*NP AR c 
IGLDCAS*S*EFFK c 
SGET*EDT*FIADLVVGLR c 


5 


G3Pl_yeast 


Glyceraldehyde 3- 

phosphate 

dehydrogenase 


LVSWYD N EYG YS*T*R c 

VIS*NASCTTNCLAPLAK 

TASGNIIPSST*GAAK 




DCPl_yeast 


Pyruvate decarboxylase 
isozyme 1 


iNr VIJjAJJAV^V^o K. 

TPANAAVPAS*T*PLK c 


CO 


KPYl_yeast 


Pyruvate kinase 1 


GVNLPGTDVDLPALS*EK 
GVNLPGT*DVDLPALSEK 


I 

= 1 


PGK_yeast 


Phospho gly cerate 
kinase 


DVT*FLNDCVGPEVEAAVK 
VLENT*EIGDSIFDK 
EGIP AGWQGLDNGPES *R 

a C \ C *"\ 7TT T T3"\TT I? 

AsAruS V LLLxirN LK 
ELPGV AFLS *EK 




PGMl_yeast 


Phosphoglycerate 
mutase 1 


SFDVPPPPIDASSPFS*QK 
VYPDVLYT*S*K c 


10 


ALF_yeast 


Aldolase 


FAIPAINVT*S*S*S*T*AVAALEAAR c 




G6PI_yeast 
HS75_yeast 


Glucose- 6-phosphate 

isomerase 

Heat shock protein 


EANVT*GLR 
SQIDEVVLVGGS*T*R c 




HS72_yeast 


Heat shock protein 


TTPSFVGFTDT*ER 




RL1 l_yeast 


60s ribosomal protein 


VLEQLSGQTPVQS*K 


15 


Rl 41 yeast 


40s ribosomal protein 


IEDVTPVPS*DS*T*R c 
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a Gene names are according to Swiss-Prot nomenclature ( www.expasy.cfa ). 

b Sequence and site of phosphorylation was identified by SEQUEST 18 (see text). 

0 Multiple marks of asterisk indicates ambiguity on the exact site of phosphorylation. All 

peptides are singly-phosphorylated. 

indicating site of phosphorylation at the serine or threonine residue to its left. 
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Table 2: 



Results on phosphoproteins in Jurkat Cells 



Human GAP SH3 binding protein 

Tumor necrosis factor type 1 receptor 
associated protein 

5 Thyroid hormone receptor-associated 
protein complex component TRAP 150 
mRNA 

Stathmin (phosphoprotein pi 9) 
Alpha enolase 
1 0 Glyceraldehyde 3 -phosphate dehydrogenase 

Pyruvate kinase 

L-lactate dehydrogenase h chain 



S S SP AP ADI AQT VQEDLR 
GVVDSEDIPLNLSR 

ASAVSELSPR 

ASGQAFELILSPR 

AAVPSGASTGIYEALELR 

VPTANVSVVDLTCR 

NTGIICTIGPASR 

VIGSGCNLDSAR 
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